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Oh 

(~! i Abstract 
"t^ ■ 

^ ' Suppose that we are given a function / : (0, 1) (0, 1) and, for some unknown p e (0, 1), a 

sequence of independent tosses of a p-coin (i.e., a coin with probabihty p of "heads"). For which 
functions / is it possible to simulate an /(p)-coin? This question was raised by S. Asniussen 
and J. Propp. A simple simulation scheme for the constant function f{p) = 1/2 was described 
^ ' by von Neumann (1951); this scheme can be easily implemented using a finite automaton. We 

CO . prove that in general, an /(p)-coin can be simulated by a finite automaton for all p € (0, 1), if 

' and only if / is a rational function over Q. We also show that if an /(p)-coin can be simulated by 

a pushdown automaton, then / is an algebraic function over Q; however, pushdown automata 
' can simulate /(p)-coins for certain non-rational functions such as f{p) ~ y/p. These results 

ff^ . complement the work of Keane and O'Brien (1994), who determined the functions / for which 

' an /(p)-coin can be simulated when there are no computational restrictions on the simulation 

fH , scheme. 
> 

s 



1 Introduction 



> 

^ ' Fifty years ago, von Neumann jl8j suggested a method to generate unbiased random bits from a 
■ sequence of i.i.d. biased bits. This method can be easily implemented using a finite automaton. 

In this paper we study the following generalization. Let V C (0, 1). Suppose that we are given 
a function / : 2? ^ (0, 1) and, for some unknown p £ T>, a sequence of independent tosses of a 
p-coin (i.e., {0, 1} valued random variables with mean p). For which functions / is it then possible 
to simulate an /(p)-coin? 

The allowed simulation schemes apply a stopping rule to independent tosses of a p-coin, and 
then determine a {0, l}-valued variable with mean f{p) as a function of the stopped sequence. We 
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emphasize that the scheme cannot depend on p. We are especially interested in simulation schemes 
that can be implemented by an automaton that receives the p-coin tosses as inputs, and outputs 
an /(p)-coin; see §1.2 for more formal definitions. 

A special case of this question was raised in 1991 by S. Asmussen (see |1U|). We learned of 
the general problem from J. Propp (personal communication) who emphasized its computational 
aspects. The problem was considered in the context of Markov chain simulation by Glynn and 
Henderson |Sj. 

In our main result we prove 

Theorem 1.1. Let T> C (0, 1) and / : 2? — > (0, 1). Then an f{p)-coin for p £ T) can he simulated 
using a finite automaton from independent tosses of a p-coin, if and only if f is the restriction to 
V of a rational function F over Q, such that < F{x) < 1 for < x < 1. 

Later using the result of the appendix we prove: 

Theorem 1.2. Let f : (0, 1) — > (0, 1). Lf an f{p)-coin can be simulated from tosses of a p-coin by 
a pushdown automaton for all p £ (0, 1), then f is an algebraic function over Q; i.e., there exists 
a non-zero polynomial P G Q[X, Y] such that P{f{p),p) = for all p G [0, 1]. 

In §3.2 we describe pushdown automata that can simulate /(p)-coins for certain non-rational 
functions such as f(j)) = ^/p. 

Our results complement the work of Keane and O'Brien (1994), who considered the simulation 
problem without any computational restrictions. They showed that the functions / : 2? — > (0, 1) for 
which an /(p)-coin can be simulated (in principle) are precisely the constants, and all continuous 
functions on D that satisfy min(/(x), 1 — f{x)) > min(j;, 1 — x)"" for some n > 1 and all x S P. 

1.1 Examples 

Here are some examples - see Figure 1. 




Figure 1: Simulating an unbiased coin, and ■i,fi_ vi 



• f{p) = 1/2. This is achievable by Von Neumann's trick: toss the p coin twice and let x and 
y be the outcome. If xy = 01 output 0, if xy = 10 declare 1; otherwise, do not declare and 
toss again. This is the leftmost automaton. 
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p-coin twice. If it's 11 declare 1, otherwise declare 0. This is the 
automaton in the middle. 

2 

• fip) — pi^fi-pyi ■ Toss the p-coin twice until you get 00 or 11. In the first case, declare 0, in 
the second declare 1. This is the automaton on the right. 

• f{p) — y/P- Theorem |T3 implies that this function cannot be simulated by a finite automaton. 
In Section 121 we construct a pushdown automaton which simulates /. 

• Our main result implies that there is no infinite set T> such that a 2p-coin can be simulated 
by a finite automaton for all p £ D from tosses of a p-coin. 



1.2 Formal definitions 

Denote by {0, 1}* the set of all finite binary strings, and call any subset of {0, 1}* a language. Say 
that a language L has the prefix property if there is no pair of distinct strings u,v £ L such that u 
is a prefix of v. 

For any binary string w, write Pp[iL'] = p"i("')(l — p)'"o{«')^ where ni{w) is the number of i's in 
w and let Pp{L) = X^^„g^ Pp(^) for language L. 

Definition 1.3. Let f -.V ^ [0, 1] where V C (0, 1). 

• A simulation of f is a pair of disjoint languages Lq,Li C {0,1}* such that Lq U Li has 
the prefix property and for all p £ D we have Pp[Li] = f{p) and Pp[io U Li] = 1 and 
Pp[L,]=f{p). 

• A simulation of / by a finite automaton, is a simulation (Lq, Li) of f such that there exists a 
finite automaton which outputs for the language Lq and outputs 1 for the language Li. An 
analogous definition applied to simulation by a pushdown automaton or a Turing machine. 

Definition 1.4. A finite automaton is defined by 

• A set of states S with a start state sq € S. 

• An alphabet which we fix to be {0, 1}, 

• A transition function 5 : S x {0, 1} — > S, 6{s,b) is the state the automaton is, given that it 
was in state s and the current input symbol is b. For a string w = wi . . .Wj^ G {0, 1}* we let 
(5(s, w) be defined inductively by 5{s, wi . . . Wk) = &{5{s, wi),W2 ■ . . Wk). 

• Two disjoint sets of final states Sq and Si. The automaton stops whenever it is in s £ SqUSi 
(formally, for all s £ SqD Si and b £ {0, 1}, it holds that 5{s,b) = s). If it stopped at Sq, 
then the output is 0; if it stopped in Si, then the output is 1. 

This is a slight variation of the standard definition (see e.g. (HI) where the automaton stops 
when the it gets to the end of the input. Here, the input is unbounded. 

Letting Lj be the strings in {0, 1}* for which the automaton stops at Si, the automaton will 
define a simulation of a function / : (0, 1) — > (0, 1), if Pp[Lo U Li] = 1. Note that Lq and Li are 
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both regular languages, and Lq U Li has the prefix property. Moreover, for every pair of disjoint 
regular languages Lq and Li with the prefix property, using a standard product construction, it 
easy to write down an automaton as in Definition 11.41 which outputs for Lq and 1 for Li. 

Note that in Figure 1, for the leftmost automaton, Lq = (00 + 11)*01, Li = (00 + 11)* 10; for the 
middle automaton, Lq = 11(0 + 1)*, Li = (00 + 01 + 10) (0 + 1)* ; and for the rightmost automaton, 
Lq = (01 + 10)*00, Li = (01 + 10)*11. 

The definition below is a slight variation of the standard definition of Pushdown automata (see 
e.g. [S] for the standard definition). 

Definition 1.5. A Pushdown automaton over the alphabet {0, 1} is defined by 

• A set of states S with a start state sq & Q. 

• A stack alphabet A. 

• A stack T £ A* which is initialized as the non-empty string r. 

• A transition function {6,r]) : S x {0,1} x A ^ 5 x A*. The first coordinate, 6{s,b,T), is the 
new state of the automaton given that it was in state s, the current input symbol is b and that 
the symbol at the top of the stack is r. After this transition the symbol at the top of the stack 
is replaced by the string in the second coordinate of r](s,b,T). 

• Two disjoint sets of final states Sq and Si. The automaton stops when the stack is empty. 
If the automaton stops at a state of Sq, the output is 0, and if it stops at a state of Si, the 
output is 1. 

Note that if Lq (Li) are the languages where the automaton output (1), then Lq and Li are 
context free languages, and Lq U Li has the prefix property. 

1.3 Motivation and related models 

Our results have a similar flavor to classical results of the Chomsky Schiitzenberger theory [3] - thus 
a relationship is established between the strength of a computation model and algebraic properties 
of the outputs generated by this model. We discuss some of the connections in more detail in 
subsections 13.11 and 14.11 

The model we introduce in this paper has also some interesting relationships with the theory of 
computability E] as in both models the interest is in "real" inputs and outputs. In particular 
for both models, the input is unbounded. See Subsection 14.21 for a more formal discussion. 

Finally a major motivation for studying this model is the theory of exact sampling, see e.g. 
|21 Q I15| llfij . In the theory of exact sampling the aim is to sample (exactly) from a certain 
complicated distribution given a simpler given distribution. Thus the problem is a basic problem 
in the theory of exact sampling. In Subsection 14.31 we give a (toy) example which illustrating how 
our results may be used for exact sampling. 
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1.4 Paper plan 



In Section[21we prove Theorem 12 . 21 and its generalizations to collections of coins, dice etc. In Section 
121 we prove Theorem 11.21 and show how using a push-down automaton, it is possible to simulate 
non-rational constants and functions. In Section 0] we discuss the relationships with the theory of 
languages and their formal power series, the theory of computability, exact sampling and suggest 
some open problems. 

2 Rationality and finite automata 

A sub-family of the coins which can be simulated via finite-automata are those which can be 
simulated via blocks. Von Neumann's trick consists of reading 2 bit blocks until 01 or 10 are 
reached, and then deciding for 01 and 1 for 10. Block simulation is a generalization of this 
procedure defined as follows 

Definition 2.1. A block simulation of f , is a simulation of f of the following form. Let Aq and 

A\ he disjoint subsets of {0,1}^, and A' = {0,1}^ \ {Aq U ^i). The simulation procedure has 

^0 = (E«,gA' ^)*(Ei«gA) ^1 = (Y^weA' ^TiYlweAi other words, the procedure reads 

a k bit string w. If w £ Aq, the procedure outputs 0, if w £ Ai the procedure outputs 1; otherwise 
the procedure discards w and reads a new k bit string. 

Below we prove the following theorem which immediately implies Theorem ll.il 
Theorem 2.2. Let D C (0, 1). For / : "D ^ (0, 1) the following are equivalent. 

I f can be block simulated. 
II f can be simulated via a finite automaton. 

Ill f is the restriction to T) of a a rational function F over Q such that < F[p) < 1 for all 
(0,1). 

Note that I ^ II is trivial. The implication // => III is fairly easy. We know of no way of 
proving II ^ I directly. Instead, we prove III =^ /. We know of no simple bound in terms of / 
on the size of automaton (or block) needed in order to simulate /, although there exists a simple 
algorithm for constructing an automaton that simulates /. 

2.1 Finite automaton =^ Rationality 

Proposition 2.3. Let P C (0, 1). If a finite automaton S simulates f : T> ^ (0,1), then f is the 
restriction toT> of a rational function F over Q such that < F{p) < 1 for all p G (0, 1). Moreover 
ifT, has n states, then F{p) = g{p)/h{p), where g{p), h{p) G and g and h are of degree at most 
n. 

The proof applies the maximum principle for harmonic functions on directed graphs. 
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Lemma 2.4. Let T, be a finite automaton and R a set of states such that for all s £ S, there exists 
w G {0,1}*, such that 6{s,w) £ R. Let < p < 1 and / : 5 — > M 6e harmonic, so for all s it 
holds that f{s) = pf{6{s, 1)) + (1 — p)f{6{s,0)). Then f achieves its maximum and minimum in 
R. Moreover, given the values of f in R, the values of f in S are uniquely determined. 

Proof. Note first that the last assertion follows from the first one, as if /i and /2 are two harmonic 
functions which have the same value on R, then /i — /2 is harmonic and has the value on R, 
which implies by the first assertion that max/i — /2 = min/i — /2 = 0, or /i = f2- 

In order to prove the first assertion, let m = max/. Note that if f{s) = m, then /(5(s,0)) = 
f{5{s, 1)) = m. Letting w G {0, 1}* be such that 5{s,w) G R, we obtain f{6{s,w)) = m as needed. 
■ 

Proof of Proposition 12. 3t Suppose that / can is simulated by a finite automaton S. Let S' 
be the set of states s such that there exists w G {0, 1}* with 6{so,w) = s. Clearly, we may remove 
from the automaton all the states not in S' (redefining 6 by restriction) and still obtain a finite 
automaton which simulates f{p). By the assumption that Pp[Lo U Li] = 1, it follows that for all 
s G 5', there exists w G {0, 1}* such that 6{s, w) £ SqU Si. From now on we assume that 5 = S'. 

Note that since for all s G 5, there exists w G {0, 1}* such that 5{s,w) £ SqU Si, it follows 
that Pp[Lo U Li] = 1 for all < p < 1. Therefore the function / is the restriction of the function 



Suppose that F{s;p) satisfies F{s;p) = for s G Sq, and F{s;p) = 1 for s G Si. For all other s, 
assume that 



By Lemma 12.41 the function F{s;p) is uniquely determined by these equations. This implies 
that F{sq;p) = F{p). Since F{p) is uniquely determined by a collection of linear equation with 
coefficients in it follows by Cramer's rule that F{p) may be written as the ratio of two 

determinants in and therefore F{p) = g{p)/h{p) £ Q{p), where the degrees of g and h are at 
most the number of states of the automaton, as needed. ■ 

2.2 Block Simulation 

In this subsection we study what can be simulated by blocks. 

Proposition 2.5. / can be simulated using a block procedure if and only if f can be written as 
D[p) / E{p) where 



=Pp[Li] top. 



F{s;p)=pF{8{s, l);p) + {I - p)F{5{s,^)-p) . 



k 




(1) 



i=0 



k 



E{p) 



p) 



k—i 



i=0 



and for all i, the coefficients di and ej are integers such that < di < Ci. 
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Proof. Suppose that / is block simulated. For a string w £ {0, 1}*, we write ni{w) for the number 
of Is in u;. Then 

where 

di = #{w £ Ai : ni{w) = i}, 

and 

ei = £ AiU Aq : ni{w) = i}, 

satisfy that < < Cj, as needed. 

For the other direction, suppose that f{p) = D{p)/ E{p), where D and E satisfy Let r be a 
number such that Cj < (^) (^^') for all i. For each fix a bijection i?j from 

{-u; e {0, 1}'' : niH = i} x {v £ {0, 1}^'' : ni(z;) = r} 

to{i,...,(?)r;)}. 

The sets ^-iid are subsets of {0, 1}*^ x {0, 1}^^ , defined as follows. Ai is defined as 
L)'^^q{{v, w) : ni{v) = i, ni{w) = r, and Bi{v, w) < di} 



and Aq as 



U^=o{(^' • = h ni{w) = r, and di < Bi{v, w) < ei}. 



So 



k—i 



j=0 
k 



i=Q 



and therefore, 



as needed. 



2.3 Rationality ^ Finite automaton 

In this section we prove Theorem 12. 21 The proof is based on a beautiful theorem by Polya (see 
jH], 57-59). We let A* denote the open s-simplex of probability distributions, 

s+l 

A^ = {pG(0,l)^+^:^p, = l}. 

i=l 
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Theorem 2.6 (Polya Let f : A^~^ be homogeneous and positive polynomial in the 

variables pi, . . . ,Ps. Then for all sufficiently large n, all the coefficients of {pi+. . ■+Ps)" f{pi, ■ ■ ■ ,Ps) 
are positive. 

Lemma 2.7. Let f : (0, 1) — > (0, 1) be a rational function. Then there exist polynomials d and e 

k 

d{p) = Y.d,p\l-pf-\ (2) 

k 

e{p) = 5]ey(l-p)'=-\ 

1=0 

where for all i, the coefficients di and Cj are integers such that < di < Cj, and f{p) = d{p)/e{p). 

Proof. As f{p) is a rational function it may be written in the form D{p)/ E{p), where D(p) £ 7j[p] 
and E(p) G Z[p] are relatively prime polynomials. Since < f{p) for all < p < 1, it follows that 
D(p) and E{p) do not change sign in the interval (0, 1). Without loss of generality we assume that 
D{p) > and E{p) > for all p £ (0, 1). Note furthermore that if D{p) = Yli=o ^iP^ ^{p) = 
X]i=o ^iP^^ then we may define homogeneous polynomials D{p, q) and E{p, q) of degree fc, by letting 
D{p,q) = T!1=o ('^P'iP + and E{p, q) = Y!1=o b^P'{p + qf~'- Note that D{p) = D{p, 1 - p) 

and E{p) = E{p, 1 — p). Let us rewrite, 

k 

D{p,q) = Y.d^P'Q"-', (3) 

i=0 
k 

E{p,q) = Y.^^p'q^-\ 

1=0 

The polynomials D{p, q), E{p, q) and E{p,q) — D(p,q) are all positive homogeneous polynomials. 
Therefore by Theorem l2.61 if follows that there exists an n such that letting d{p, q) = {p+q)'^D{p, q) 
and e{p, q) = {p + q)^E{p, q), the polynomials d,e and e — d all have positive coefficients as polyno- 
mials in p and q. Writing f{p) = d{p, 1 — p)/e{p, 1 — p) we obtain the required result. ■ 

Proof of Theorem 12.21 The implication L ^ II is trivial, the implication // ^ III follows 
from Proposition 12. .SI while /// =^ / follows from Lemma 12.71 together with Proposition 12.51 

2.4 Extensions to dice and other fc-sided coins 

In this subsection we discuss generalizations of the problem to A;-sided coins, such as dice. A 
simulation of / = (/i, . . . , /t) : A'^ — > A*, is a collection of t disjoint languages property (Li, . . . , Lt) 
over the alphabet S = {1, . . . , s}, such that ul^^Li has the prefix property and Pp[-Li] = fi{p) for all 
p £ A^. The definition of simulation via finite/pushdown automata and Turing machines naturally 
extend to this setting. The continuity results of extend to the more general setting as well. 

Proposition 2.8. // a finite automaton simulates / : A'' — > A*, then f is a rational function over 
Z (i.e. fiip) is a rational function over Z for all 1 < i < t + 1). 
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Proof. Repeat the proof of Proposition 12.31 for each of the /j's. ■. 

Repeating the proof of Theorem 12.21 we see that 
Theorem 2.9. Any rational function f : ^ can be simulated via blocks. 

From which we conclude that 
Corollary 2.10. Any rational function / : A* ^ A* can be simulated via blocks. 

Proof. The proof is by induction on t. The case t = 1 is covered by Theorem 12.91 Suppose t > 1, 
and let (/i, . . . , /j+i) : A'* — > A* be a rational function. By Theorem 12.91 there exists a block 
simulation {Ai,A2) C (S*^)^ for — /i), and by the induction hypothesis there exists a block 

simulation (Bi, . . . , 5^) C (S^* for (^/(l - /i), . . . , ft+i/{l - fi))- Taking 

(Ai X S^ X Si, X i?2, . . . , ^2 X Bt) C 

we obtain a block simulation for (/i, . . . , ft+i) as needed. □ 

3 Pushdown automata 

We now prove Theorem ll.21 We begin by showing that if / is simulated by a pushdown automaton, 
then / is the unique solution of a set of polynomial equations. We then invoke the results of the 
appendix to deduce that / is an algebraic function. In Section |31 we also construct pushdown 
automata which simulate non-rational functions such as f{p) = .^/p. We don't know if every 
algebraic / : (0, 1) (0, 1) can be simulated by a pushdown automaton. 



3.1 Pushdown automata and algebraic functions 

The Chomsky-Schiitzenberger theory implies that if Lq and Li are languages which are generated 
by unambiguous grammars and f{p) = Pp[Li], then f{p) is an algebraic function. In this subsection 
we aim to prove algebraic properties of / even when Lq and Li are inherently ambiguous. 

Suppose S is a pushdown automaton which simulates a function /. Call {b, s) G A x S good, if 
when the automaton is at state s and the stack is bw (where b is at the top) , then with probability 1 
at some point the stack will be w. Call (6, s) bad otherwise. By the assumption that Pp[LoULi] = 1, 
it follows that starting at {sq,t) it is impossible for the automaton to reach a state s with bw at 
the top of the stack, where {b,s) is bad. Thus we can redefine all transitions {b,s) — > {b'uj',s'), 
where {b',s') is bad, in an arbitrary manner, and still obtain Pp[Lo U Li] = 1. Therefore, without 
loss of generality we may assume that all {b, s) £ A x S are good. 

Let a{p; b,s,s'), a : [0, 1] x A x S x S ^ [0, 1] be defined as follows. For vu £ A*, let a{p; b, s, s') 
be the probability that given that currently the automaton is at state s and has in its stack bw 
(where b is at the top), at the first time that the content of the stack will be w, it will be at state 
s'. It is easily seen that a{p;b, s, .s') is well defined (does not depend on w). Moreover, by the 
assumption that all (6, s) are good, it follows that J2s'€SiUS ^{P'^^i^i^') — 1- 
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We extend the definition of a to a(p; ti, s, s'), a : A* x 5 x 5 ^ [0, 1], wliere a(p; n, s, s') is tlie 
probability that given that currently the automaton is at state s and has in its stack uw (where u 
is above vS)^ at the first time that the content of the stack will be it will be at state s' . Note 
that \i w = w\ . . .Wr^ then 

r 

a{p; w, s, s') = X] n "(P' 

i=l 

where the sum is over all 

(Si, S2, . . . , Sr, Sr+l) G {«} X ^''^^ X {s'}, 

and if e denotes the empty word, then 

Vs, s' eQ : d{p; e, s, s') = l{s=s'}- (5) 

Note that if r is the initial word at the stack, then 

= X] «(?';'^'^o,s')- (6) 

s'eSi 

Therefore if we could prove algebraic properties of the functions a{p;b, s, s'), we will deduce 
algebraic properties of /. 

Claim 3.1. For all < p < 1, a{p; •,-,•) : A* x S x S ^ M. is the unique bounded solution of the 
equations 

VifJ G A*, V6 G A, Vs, s' G (5 : d{p;bw, s, s') = pd [p]6{s,l,b)w,ri{s,l,b),s') (7) 

+ (1 - P)a [p; 5{s, 0, b)w, r]{s, 0, b), s') , 



Vs,s' G Q : a(p;e, s,s') = l{s=s'} (e i/ie empty word). (8) 

Proof. By linearity, it suffices to prove that the zero function is the only bounded solution to (O 
with the boundary conditions 

Vs,s' G Q : d{p;e,s,s') = 0. (9) 

Fix (wjs) G A* X Q. We will show that d{p;w, s, s') = 0. Consider the random walk {Wt,St)w,s 
defined on the graph A* x Q, where {Wq, So)w,s = {wjs). Given Wt = BU, where i? G A, the 
conditional probabilities for {Wt+i, St+i) are given by 

, . ^ ( {S{St,l,B)U,r]{St,l,B)) with probability p, , . 

\^vvt+i,ot+i) ]^ ^s^St,0,B)U,ri{St,0,B)) with probability 1 - p. ^ ' 

If Wt is the empty word, then we let {Wt+i, St+i) = {Wt,St). By definition, for all s' G Q, the 
process d(j>;Wt, St, s') is a bounded martingale. The assumption that the pushdown automaton 
stops a.s. implies by Q that the martingale converges to a.s. (and therefore in Li). We therefore 
conclude that d(p; Wt, St, s') is identically as needed. ■ 
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Recall that given that the current state is s, and the top of the stack is b, with probability p 
the automaton will move to state si = 6{s, 1, b), and instead of 6, the top of the stack will contain 
rj{s, 1, 6) = ci . . . Cr] with probability I — p the automaton will move to state si = 6{s, 0, b), and 
instead of b, the top of the stack will contain r]{s, 0,b) = ci . . . Cf. 

We can therefore write 

a{p;b,s,s') = p (l{r=o}l{si=s'} + l{r>o}a(p; ci • • • Cr, si, s')) (H) 

+ (1 - P) (l{f=0}l{si=^i'} + l{f>0}«(p; Ci . . . Cf , si,s')) , 

Or 

a{p;b,s,s') = p (^{r=o}'^{si=s'} + '^{r>o}'^]Xo:{p] Ci, Si, Si+i)^ (12) 

+ i'^-p) (^{f=0}'^{si=s'} + '^{r>0}'^]^a{p;Ci,Si,Si+i)^ . 

where the first (second) sum is taken over all 

{Si, . . . , Sr+l) G {Si} X 5''^^ X {s'}, 

(si,...,v+i) G {si} X 5''-^ X {s'}. 

Note that (fT^ defines a set of algebraic equations in p and {a{p; b, s, s')}6,s,s'- 

Claim 3.2. For allO < p < 1, there is a unique positive solution a{p; •,-,•) : [0, 1] x A x x — > M, 
to equations il^) and 

V6 G A, Vs e Q : ^ a{p; b, s, s') = 1. (13) 

s'eQ 

Proof. In order to prove the claim it suffices to show that each positive solution to (|12j) and (|1.S|) 
defines a positive bounded solution to ((TJ and Q via (0J) and 

It is immediate to see that ()12() implies ((T)). Moreover, H13() implies by ^ that for all w G A* 
and s G Q, it holds that 

^ d{p;w,s,s') = 1. 

s'GQ 

and so a is a bounded function as needed. ■ 

Proof of Theorem 11.21 The result follows immediately from Claim 13.21 by Theorem lA.ll ■ 

3.2 Pushdown automata which simulate non-rational functions 

In this subsection we construct a pushdown automaton which simulates a non-rational function. Let 
g : (0, 1) — > (0, 1/2) be a rational function. We'll construct a pushdown automata which simulates 

the function 7(p) = • 

Taking g{p) = (1 — p)/2, we obtain j{p) = (1 — y/p)/{l —p)- thus using a product construction, 
it is easy to construct a pushdown automaton simulating the function f{p) = y/p. 
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Consider a random walk on the ladder graph N x {0, 1} where an edge {{x, y), {x' , y')) is present 
if X = x' and \y — y'\ = 1, or |a; — x'| = 1 and y = y' . The random walk moves to the left (right) 
with probability 1 — 2g{p), and up (down) with probability g{p). Let 7 be the probability that 
starting at (0, 1) the first hitting point of the random walk at level ({(0, 0), (1, 0)}) is (0, 0). It is 
easy to see that 

7 = 5(p) + (l-2<7(p))(l-7)+5(p)(7' + (l-7)') 
= 1-7 + 25(^)72, 

and therefore 



1 - V'l - 2g{p) 

^(f' = — m) — ■ 

It is easy to simulate the random walk with a pushdown automaton. The stack alphabet is {x} 
where correspond to level n of the ladder - the initial word at the stack is x. 

Assume first that the input alphabet is {0, 1,2} where the probability that letter 1 appears is 
1 — 2g{p), and the probability that letter or 2 appears is g{p). 

Let So and si be two states of the automaton corresponding the left ({0} x N) and right ({1} x N) 
of the ladder. Reading the symbol 1 will correspond to a transition from sq to si or vice-versa 
without changing the content of the stack. Reading the symbol 2 at state Si will result at staying 
at state Si and pushing an x to the stack. Reading the symbol at state Sj with x at the top of the 
stack, will result at staying at state Si while popping x from the stack. In this way it is possible to 
simulate the random walk given an infinite sequence of {0, 1, 2} symbols with {g{p), 1 — 2g{p), g{p)) 
bias - and therefore toss a coin with bias (|14() . 

In the general case where we are given an infinite sequence of {p, 1 — p) bits, we use block 
constructions of Section|21in order to generate a sequence of {g{p), 1 — 2g{p), g{p)) {0, 1, 2} variables 
together with the above construction in order to obtain the required result. 

Remark 3.3. Similarly one may construct a pushdown automaton associated with a random walk 
on the ladder where the probabilities of going {up, left-right, down) are given by {g{p), 1 — g{p) — 
h{p),h(p)). Note however that in this case Pp[LQ U Li] = 1 iff the random walk is recurrent iff 
h{p) > g{p)- Thus, unlike finite automata, there exist pushdown automata which define a valid 
simulation only for a proper subset of the interval (0, 1). 
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4 Related models and open problems 



4.1 Chomsky-Schiitzenberger theory 

In the seminal paper |3| titled "The algebraic theory of context-free languages" by Chomsky and 
Schiitzenberger, the authors discuss many beautiful relationships between different types of lan- 
guages and their generating functions. The generating function of a grammar generating a language 
L is defined as / = ^^^^i^ n(w)w, where n{w) is the number of derivations of the word w in the 
grammar, and / is viewed as a formal power series in the non-commutative variables and 1. The 
results of [3] imply in particular that the generating function of a regular language is rational, which 
implies Proposition 12.31 - simulation via finite automata always yield rational functions. 

In [1] it is also proven that the generating function of a context-free language is algebraic. 
However, this does not imply that coins tossed via pushdown automata are algebraic, as many of 
the context free languages are inherently ambiguous and for such languages, the non-commutative 
power series P{L) = X^^gjg ly l{i«gL}^ is not algebraic, (see e.g. 0; as Larry Ruzzo kindly noted, 
there are also context free languages with the prefix property that are inherently ambiguous). 

Thus, while Proposition l2.3l could be obtained by projecting the Chomsky-Schiitzenberger results 
from the non-commutative setting to the commutative setting, an analogous result for pushdown 
automata cannot be obtained in a similar way. Theorem 12.21 mav be interpreted as a commutative 
inverse to the Chomsky-Schiitzenberger result for regular languages, where additional positivity 
restrictions are imposed. 

4.2 Theory of computabihty and Turing machines 

A distinctive feature of the computation models discussed in this paper is that they have an 
unbounded input, unlike the classical Turing machine for which the input is of bounded length. 

In (see JSl for background), a computational model in analysis is introduced. This model 
has some common features with the model introduced here - in particular, the input is given as an 
unbounded sequence. However, the models are different as our model has an input drawn according 
to an i.i.d. distribution. Moreover, in our model we are looking for a coin which has exactly the 
correct distribution f{p), while in [7j the aim is find a Turing machine which for an (unbounded) 
input X, and an error bound e computes an approximation of the function /(x) within error margin 

There are, however, some striking similarities between the two models. For instance the conti- 
nuity results of ^01 should be compared with [2j where it is shown that computable functions are 
continuous. Moreover it is easy to adapt the proofs of ^0] in order to show that if / : (0, 1) (0, 1) 
is computable and polynomially bounded from and 1 at and 1, then it possible to simulate the 
function /(p) via a Turing machine. 

4.3 Exact samphng 

The theory of exact sampling (see e.g. P|[nil5|ll6j) deals with simulating a complicated probability 
measure using a simple one. Usually, the simple measure consists of a sequence of unbiased bits or 
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a a sequence of uniform variables. 

In this paper both probabihty measures of interest (the p coin and the f{p) coin) are simple. 
The difficulty here is that p is unknown. It is easy to estimate p, and therefore f{p)- However, to 
get a coin with an exact bias f{p) is harder. 

Yet, the problem studied here may be interpreted as a problem in exact sampling. Consider the 
following examples. 

• Suppose that some physical process produces percolation configuration on a grid Fi, where 
the probability of an open edge is p. We are interested in performing percolation on a grid 
where we want that the probability that an edge e is open in r2 to be equal to the probability 
that two vertices at distance 2 in Ti are connected by a path of length 2. Our results allow 
to use samples of the configuration Fi in order to produce samples for the process on 

• Let {xi}i>i be i.i.d. bits with unknown mean a. Suppose that we are given the products 
{yi}i>i, where yi = X2iX2i-i, and we want to simulate (one or more) i.i.d. bits {zj} with 
mean a. This can be done using the pushdown automaton in Subsection \'A.2\ that simulates 
the function ^/p. 

4.4 Some open problems 

Problem 4.1. Let f : (0, 1) — > (0, 1) be a rational function. What is the smallest size of an 
automaton that simulates f{p)? Is there an efficient algorithm for finding this automaton? 

This problem is potentially hard, as the size of the automaton probably depends on analytic 
and number theoretic properties of / (see jl4jl. 

Problem 4.2. Let f : (0, 1) — > (0, 1) be an algebraic function. Can f be simulated by a pushdown 
automaton? 

Acknowledgment: We thank Jim Propp for suggesting the problem. We are grateful to Omer 
Angel, Paul Beame and Larry Ruzzo for helpful remarks, and to Russ Lyons and Sergey Fomin 
for references. Most of this work was done while the first author was a postdoctoral researcher at 
Microsoft Research, and the second author was visiting there. 
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A Appendix on Algebraic functions 



Christopher J. Miliar ^ 
chillar@math.berkeley.edu 

The purpose of this note is to establish the following fact using techniques from real algebraic 
geometry. It will be a direct corollary of the more general Theorem IA.2I below. 

Theorem A. 1. Let {Fi{p,xi, . . . ,Xn)}iLi C Q[p, xi, . . . , and let S be the set in W^~^^ defined 
by 

{n m 
{p,xi,...,Xn) : 0<p<l A /\0<Xj<l A /\Fi{p,xi, . . . ,Xn) = 
j=i i=i 

Suppose that for each p G (0, 1), there exist a unique (ai, . . . , a„,) G M.^ such that {p,ai, . . . , a„,) G 
S; equivalently, S is given as the image of some function 4> : (0, 1) [0, 1]'"+^ with (pip) = 
{p,4>i{p)j ■ ■ ■ T4>n{p))- Then, there exist nonzero polynomials gj £ Q[X,Y] such that 

9j{P,4>j{p)) = 

for allp £ (0,1). 

We begin with an abstract setting. Let K he a field, and let P be a subset of K satisfying the 
following two properties: 

1. If X, y G P, then x + y and xy G P. 

1. K is the disjoint union of P, {0}, and —P. 

A subset P as above is called the set of positive elements, and we say that P is an ordering of K. 
A field K is then an ordered field if there exists an ordering of K. A real field is a field in which —1 
is not a sum of squares, and a real closed field i? is a real field such that such that any algebraic 
extension of R that is real must be equal to R. For example, both M and R f) are real closed 
fields (where Q*^ is the algebraic closure of Q). For ease of notation below, we set Q* = M n Q". 
Prom the definition, it is clear that any real field must have characteristic 0, and thus Q naturally 
embeds in any real field. It also follows that Q* is a subset of every real closed field. 

Any real closed field R has a unique ordering, and the positive elements are the squares of R. 
Moreover, every polynomial of odd degree in R[x] has a root in i? jl| p. 452]. In light of these 
observations, the axioms for the theory of real closed fields (RCF) consist of [3 p. 24]: 

1. the axioms for ordered fields; 

2. Vx > 3y y"^ = x; 

3. the axiom Vxq • • • Vx„_i 3y + Xn-iy"~^ + • • • + j;o = for each odd n > 0. 
^This work is supported under a National Science Foundation Graduate Research Fellowship. 
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The set S as in Theorem lA.ll is an example of a set definable by a Boolean combination of a 
finite number of polynomial inequalities and equalities (over Q). Such a set is called semialgebraic 
(over Q), and a function is called semialgebraic if its graph is a semialgebraic set. We may now 
state the main theorem. 

Theorem A. 2. Let R be a real closed field, and let A he a semialgebraic subset of R defined 
using polynomials {Hi{py\^^^ C Q[p]. Also, let S be a semialgebraic subset of R^^^ defined by 
{Fi{p,xi, . . . ^Xn)}^i C Q[p,xi, . . . Suppose that for each p £ A, there exist a unique point 

(ai,...,an) G R^ such that (p, ai, . . . , a„) G S; equivalently, S is given as the image of some 
function : A ^ R^'^^ with (j){p) = (p, (/)i(p), . . . , 0„(p)). Then, there exist nonzero polynomials 
gj £ <Q[X,Y] such that 

9j{P,(t>j{p)) = 

for all p £ A. 

A fundamental fact about RCF is that it is a complete theory ^ p. 19] in the sense that each 
first order sentence expressible in the theory of RCF is either true in every structure satisfying the 
RCF axioms or false in every such structure. For example, the sentence, Vx 3y y"^ = —x, evaluates 
to false for any structure (such as M) satisfying the axioms of RCF. As a standard application of 
completeness, we present the following 

Lemma A. 3. Assume the hypothesis of Theorem \A . Then, for each p £ AnQ* , we have that 
(pjip) £ Q* for allj £ {l,...,n}. 

Proof. To simplify notation, we write "p £ A", for example, in place of the Boolean combination 
of polynomial inequalities and equalities that defines A. As Q embeds in any real closed field, the 
sentence, 

yp3xi . . .3xn ^{p £ A) V {p,xi, . . . ,Xn) £ S, 

is a valid sentence in any structure satisfying the axioms of RCF. By completeness, it must have 
the same truth value in every real closed field. Since it is a true statement in R by assumption, it 
follows that it is also true in the real closed field Q*. Let p £ ACiQ* . Then, there exists a tuple, 
a = (oi, . . . ,a„) £ (Q*)", such that {p,ai, . . . ,a„) £ S. By the hypothesis, this a must be the 
unique tuple in i?" for this p, and hence for all j, <j)j{p) = Oj £ Q*, completing the proof. ■ 

A key result in the theory of RCF is the following theorem (which is essentially a restatement 
of the fact that RCF has quantifier elimination) p. 92]. 

Theorem A. 4 (Tarski-Seidenberg Theorem). The projection of a semialgebraic set is semial- 
gebraic. 

Corollary A. 5. The functions (pj are semialgebraic functions. 

Proof. Let T = {{p, xi, . . . , x^) ■ p £ A A (j>,xi, . . . , Xm) £ -S}, which is semialgebraic. Then, the 
image of the projection of T into R^ given by (p, ai, . . . , an) ^ (p, Oj) is also semialgebraic by the 
theorem. ■ 

Semialgebraic functions are well-behaved in the following sense (HI p- 17]. 
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Theorem A. 6. Let R be a real closed field. If T is a semialgebraic subset of and h : T ^ R is 
semialgebraic, then there is a nonzero polynomial (^(Xi, . . . , X„, 1") G . . . , X„, y] such that 

g(x, h{x)) = for all x E T. 

We remark that applying Theorem IA.6I with Corollary IA.5I already gives a result similar to 
Theorem IA.2I The only subtlety is that we would like the polynomial g to be in Q[A, y] instead 
of R[X, Y]. We are now ready prove Theorem IA.2I 

Proof of Theorem \A.Sl Fix j £ {1, . . . ,n}. We will apply Theorem IA.6I with R = Q* and T = 
AnQ*. Prom Lemma IA.31 it follows that (T) C Q* , and from Corollary IA.51 we have that 
is semialgebraic. Therefore, from Theorem lA. 61 there is a nonzero polynomial g{X,Y) € Q*[X, y] 
such that g{p, (t)j{p)) = for all p G j4nQ*. We will now produce a nonzero polynomial g E Q[X, Y\ 
with the same property. 

Consider the field, Q*(X), of rational functions in the variable X. View g{X, Y) as a polynomial 
in the variable Y over Q*(A), and, upon clearing denominators, let qi{X,Y) G Q*[X, y] {i = 
1, . . . , r) be the irreducible factors of g{X, Y) (over Q*(A)). It is clear that for all p € A n Q*, we 
have 

r 

\{qi{pAAp)) = ^- 

i=l 

Fix i and let ai, . . . , at G Q* be all of the coefficients in qi{X, Y). Extend Q(A) by these coefficients, 
so that Q(A)(ai, . . . ,at) is a finite extension of Q(A). Also, let Q(A)(ai, . . . ,at){y) be a finite 
extension of Q(A)(qi, . . . ,aj) defined by the equation qi{X,y) = 0. It follows that y is algebraic 
over Q(A), and upon clearing denominators, let qi{X,Y) £ Q[X,Y] be such that qi{X,y) = 0. 

Since both qi{X,Y) and qi{X,Y) have y as a root and since qi{X,Y) is irreducible, it follows 
that qi{X,Y) divides qi{X,Y). As both qi{X,Y) and qi{X,Y) are in Q*[X,Y], Gauss's Lemma jH 
p. 181] gives us that, 

hiX,Y) = qi{X.Y)/q,{X,Y) G Q*[A,y]. 

Let 

r 

~g{X,Y) = J{qi{X,Y)£Q[X,Y]. 

1=1 

We claim that g{X, Y) is our desired polynomial. But indeed, for all p G j4 n Q*, 

r 

9{p,4>j(.p)) = Y{qi{p,4>j{p))hi{p,(l)j{p)) = 0. 
1=1 

Finally, let A and R be as in the statement of Theorem IA.2| and let W be the graph of 
(f)j : A ^ R, which is a semialgebraic set (over Q). Consider the sentence, 

\/pya ^{{p,a) eW) V g{p,a) = 0, 

which is valid in any structure satisfying the axioms of RCF (again, since Q embeds in any real 
closed field). By the above argument, it is a true statement for Q*, and therefore, by completeness, 
it is also true for R. This completes the proof. ■ 

As a final remark, we note that the fussiness in the proof of Theorem I A . 21 was necessary to avoid 
division by zero after applying the substitution homomorphism with X t-^ p and Y i-^ (pj (p) . 
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